TJU GSummary at TAC2011: Category oriented extractive content selection for guided summarization

نویسندگان

  • Ruifang He
  • Kang Fu
چکیده

The guided summarization aims to capture evolving information of a single topic changing over time under the background of much emergency happenings. It delivers salient and novel information to a user who has already read a set of older documents covering the same topic. Topic is category oriented, however, there is no query description. Therefore, guided summarization raises new challenges. In this paper, the category oriented extractive content selection method for guided summarization is proposed, which is completely language independence. Meanwhile, we submitted two systems. Our systems rank top 5 under PYRAMID metrics among 48 running systems. However, we rank middle under ROUGE and BE averagely. Our methods show the great difference for different evaluation metrics. Therefore, we try to think why this happens? What is the best method? What is the best evaluation metrics?

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploiting Category-Specific Information for Multi-Document Summarization

We show that by making use of information common to document sets belonging to a common category, we can improve the quality of automatically extracted content in multi-document summaries. This simple property is widely applicable in multi-document summarization tasks, and can be encapsulated by the concept of category-specific importance (CSI). Our experiments show that CSI is a valuable metri...

متن کامل

SWING: Exploiting Category-Specific Information for Guided Summarization

We present our work towards building a robust multiple document summarizer (SWING), with a focus on guided summarization. SWING is an extractive summarizer built upon information retrieval principles. Our key contribution is utilizing category knowledge, collected over all news topics, to calculate category-specific importance (CSI) of sentences. We propose two new category-specific features in...

متن کامل

Who wrote What Where: Analyzing the content of human and automatic summaries

Abstractive summarization has been a longstanding and long-term goal in automatic summarization, because systems that can generate abstracts demonstrate a deeper understanding of language and the meaning of documents than systems that merely extract sentences from those documents. Genest (2009) showed that summaries from the top automatic summarizers are judged as comparable to manual extractiv...

متن کامل

Decayed DivRank for Guided Summarization

Guided summarization is essentially an aspect-based multi-document summarization, where aspects can be taken as specified queries in summarization. We proposed a novel ranking algorithm, Decayed DivRank (DDRank) for guided summarization tasks of TAC2011. DDRank can address relevance, importance, diversity, and novelty simultaneously through a decayed vertex-reinforced random walk process in sen...

متن کامل

Empirical analysis of exploiting review helpfulness for extractive summarization of online reviews

We propose a novel unsupervised extractive approach for summarizing online reviews by exploiting review helpfulness ratings. In addition to using the helpfulness ratings for review-level filtering, we suggest using them as the supervision of a topic model for sentence-level content scoring. The proposed method is metadata-driven, requiring no human annotation, and generalizable to different kin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011